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SEARCH PROCESSING METHOD AND APPARATUS 

TECHNICAL FIELD OF THE INVENTION 

5 This invention relates to search technology for a document 

database, and more particularly to a user interface when the search is 
carried out. 

BACKGROUND OF THE INVENTION 

10 

Hitherto, when features, similarities or the like of sentences 
included in a document group were analyzed, for example, the document 
group was processed by using the text mining technology, search 
technology with weighting calculation, or the like, and the processing 
15 result was converted into a table or a graph to present it to an analyst. 
In the conventional technology, the table, graph or diagram was 
individually generated and displayed based on the original data in the 
document group. 

However, there is a case in which the analyst would like to obtain 
20 the following analysis result from a certain analysis result when 
progressing the analysis. It was possible for the conventional 
. technology to carry out a simple processing such as simply referring 
to a portion of the search result, and narrowing the search result, but 

a pertinent portion was merely displayed and any changes were not given 

v 

25 to the meanings of the search results. Furthermore, when only an 
interesting portion in a certain table is displayed with a new viewpoint 
or the portion is converted into a diagram or the like in another 
viewpoint to display it, for instance, the following process was needed: 
that is, 1) return to the original data in the document group once, 2) 

30 set a search condition again, 3) carry out a processing by using 
numerical values, text, or the like in the data of the document group, 
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and 4) convert the processing result into a desired display form. 
Therefore, in such a case , because it took much time and work, the analyst 
had to be confronted with a big hurdle when he or she would like to make 
a consideration for the following analysis from the analysis result at 
5 once . 

In addition, JP-A-20 0 1-92 8 51 discloses following technology. 
That is, a method is disclosed in which after data of a search file 
obtained from a patent/technology document search system is 
automatically converted by a computer into patent information or 

10 technology information analysis master table, a patent map or various 
aggregation graph in a necessary form is output. However, even by this 
technology, in a case where the data of the search file must be changed, 
it must be separately converted into another master table. Therefore, 
in a case where the previous search result is narrowed, it is necessary 

15 to prepare new search file as usual, and any processing and work are 
not simplified. 

SUMMARY OF THE INVENTION 

20 Therefore, an object of this invention is to provide the search 

processing technology to enable a searcher to intuitively analyze the 
search result. 

A search processing method according to this invention comprises 
the steps of: searching a predetermined document group according to a 

25 search condition specified by a user, and storing data of a plurality 
of documents extracted by the search into a storage device; transforming 
the data of the plurality of documents extracted by the search into data 
to indicate the data of the plurality of documents to the user in a first 
display form and to enable the user to select each display item as a 

30 following processing key, and outputting the transformed data; 
extracting data of documents corresponding to the display items directly 



or indirectly selected by the user; and transforming the data of the 
documents corresponding to the selected display items into data to 
indicate the data of the documents to the user in a second display form 
specified by the user and to enable the user to select each display item 
5 as a following processing key, and outputting the transformed data. 

Thus, the data of the plural documents extracted by the search 
is indicated as following processing keys and selectable display items 
to the user in the first display form, and in response to the selection 
of display items by the user, the data of the documents corresponding 

10 to the selected display items is indicated as further following 
processing keys and selectable display items to the user in the second 
display form specified by the user. Therefore, it becomes possible to 
change the analysis object and display manner via the intuitive 
interface and to progress the analysis. 

15 Moreover, each of the aforementioned first and second display 

forms may be at least either of (a) a form indicating the plurality of 
documents to be processed that are clustered by used words, by a 
predefined display matter, (b) a form indicating the plurality of 
documents to be processed, by a predefined display matter and a 

20 connection line representing a degree of relevancy between the plurality 
of documents, that is calculated by used words, (c) a form indicating 
a result in which the plurality of documents to be processed are 
classified and aggregated as to used words, by a graph, (d) a form 
indicating used words in the plurality of documents to be processed and 

25 a connection line representing a degree of relevancy among the used words, 
(e) a form indicating a document group associated by a specific matter 
among the plurality of documents to be processed, by the specific item, 
and a degree of relevancy between the document group associated by the 
specific item and each used word in the plurality of documents to be 

30 processed, by a connection line between the specific matter and each 
used word. For example, the transition among these display forms is 



freely carried out, and the analysis can be progressed. Incidentally, 
this invention is not limited to these display forms, and other display 
forms may be included- Moreover, an arbitrary display form may be 
selected in another combination. 
5 In addition, in the aforementioned first and second transforming 

steps, a display program corresponding to the display form may be 
designated and data for the display program may be generated . The display 
program is switched in conformity with the environment of the user 
terminal as the occasion demands. Moreover, this invention can be 
10 realized in any of the stand-alone environment and client-server 
environment . 

Incidentally, the aforementioned method may be carried out by 
a combination of a program and computer hardware, and the computer works 
as a search processing apparatus. In addition, the aforementioned 
15 program is stored in a storage medium or storage device such as a flexible 
disk, CD-ROM, magneto-optical disk, semiconductor memory, and hard disk. 
Moreover, it may be distributed via a network as a digital signal. 
Incidentally, an intermediate processing result is temporarily stored 
into a storage device such as a main memory. 

20 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a functional block diagram in one embodiment of this 
invention ; 

25 Fig. 2 is a drawing showing a main processing flow in the 

embodiment of this invention; 

Fig. 3 is a drawing showing one example of the data stored in 
a document DB; 

Fig. 4 is a drawing showing one example of data stored in a related 
30 word data storage; 

Fig. 5 is a drawing showing a screen example presenting a first 
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search result; 

Fig. 6 is a drawing showing a processing flow to display a first 
display form; 

Fig. 7 is a drawing showing the first display form; 
5 Fig. 8 is a drawing showing one example of a display form 

designation column in the first display form; 

Fig. 9 is a drawing showing the first display form; 
Fig. 10 is a drawing showing a menu in the first display form; 
Fig. 11 is a drawing showing a main processing flow in the 
10 embodiment of this invention; 

Fig. 12 is a drawing showing a processing flow to display a second 
display form; 

Fig. 13 is a drawing showing one example of data stored in a 
relevancy degree data storage; 
15 Fig. 14 is a drawing showing the second display form; 

Fig- 15 is a drawing showing a third display form; 

Fig. 16 is a drawing showing the second display form; 

Fig. 17 is a drawing showing a fourth display form; 

Fig. 18 is a drawing showing a processing flow to display a fifth 
20 display form; 

Fig. 19 is a drawing showing the fifth display form; 

Fig. 20 is a drawing showing a sixth display form; 

Fig. 21 is a drawing showing the fifth display form; 

Fig. 22 is a drawing showing a menu in the fifth display form; 
25 Fig. 23 is a drawing showing a processing flow to display a seventh 

display form; 

Fig. 24 is a drawing showing the seventh display form; 
Fig. 25 is a drawing showing a processing flow to display an eighth 
display form; and 
30 Fig. 26 is a drawing showing the eighth display form. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 



Fig. 1 shows a system outline figure according to one embodiment 
of the present invention. For instance, a search server 5 that has a 
5 web server function, for example, and one or a plurality of client 
terminals 3 that are personal computers and have a web browser function 
are connected with a network 1 that is the Internet or Local Area Network 
(LAN), for example. 

The search server 5 manages a document DB 53 that stores documents 

10 to be searched, relevancy degree data storage 54 that stores data or 
the like concerning the degree of relevancy between documents extracted 
by the search or the like, and related word data storage 55 that stores 
data of used words or phrases (hereafter, simply called "words") in the 
documents extracted by the search or the like. Moreover, the search 

15 server 5 includes a search processor 51 that carries out a search 
processing to the document DB 53, and data operating unit 52 that 
processes data or the like of the documents extracted by the search 
processor 51 by a predetermined algorithm. Incidentally, the search 
server 5 may achieve the functions explained below by not only one 

20 computer but also plural computers. Moreover, the search server 5 may 
communicate with the client terminals 3 via an interface other than the 
web server function by conforming itself with the function of the client 
terminals 3 . 

Input devices 32 such as a mouse and keyboard, and a display device 
25 31 such as a display or the like are connected to the client terminal 
3, and the client terminal 3 communicates data with the search servers 
5, and has a client search interface 33 that provides a user interface 
to the user of the client terminal 3, and one or a plurality of display 
processors 34 that carry out a display processing when a table, graph, 
30 flow diagram of the search result, or the like is displayed and provide 
an user interface for the table, graph, flow diagram or the like. When 



the data is communicated with the search servers 5 based on the web 
technology, the client search interface 33 is a web browser. However, 
it is also possible to use a dedicated client program. 

Next, the processing content in the system shown in Fig. 1 will 
5 be explained by using Fig. 2 to Fig. 26. Incidentally, the search of 
the patent documents will be explained as an example in the following 
descriptions. First of all, a user operates the input device 32 of the 
client terminal 3 to give an instruction to the client search interface 
33, and causes it access a search condition designation page (step SI) . 

10 The search server 5 transmits the data of the search condition 
designation page to the client terminal 3 in response to the access by 
the client terminal 3 (step S3) . The client search interface 33 of the 
client terminal 3 receives the data of the search condition designation 
page from the search server 5, and displays it on the display device 

15 31 (step S5) . For instance, a screen including at least a keyword input 
column, a column for specifying the database (s) to be searched, and the 
like is displayed. According to circumstances, sentences may be input 
instead of the keywords to use them as a search condition. In this case, 
the search words are extracted from the sentences by the morphemic 

20 analysis or the like. The user inputs the search keyword and the like, 
and specifies the database (s) to be searched. 

The client search interface 33 of the client terminal 3 accepts 
the search condition input such as the search keyword from the user (step 
S7) , and transmits the search condition input data to the search server 

25 5 (step S9) . The search processor 51 of the search server .5 receives 
the search condition input data from the client terminal 3, and 
temporarily stores it into the storage device (step Sll) . Then, it 
searches the document DB 53 according to the search condition such as 
the search keyword, and extracts the data of the pertinent documents 

30 (step S13) . 

Data as shown in Fig . 3 is stored in the document DB 53 , for example . 
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A table example in Fig. 3 includes a column 201 of a record number of 
a document, column 202 of an application number, column 203 of a filing 
date, column 204 of a publication number, column 205 of a publication 
date, column 206 of a registration number, . . . column 207 of an applicant, 
5 column 208 of a title, column 209 of an abstract, and the like . In addition 
to the aforementioned data, data of the text is stored. Referring to 
such data, the search is carried out. 

Then, the data operating unit 52 carries out a predetermined 
processing- for the extracted data (step S14) . The predetermined 

10 processing is a processing in which a degree of relevancy of each 
extracted document is calculated for the search keyword or the like, 
for example, and the extracted documents are rearranged according to 
the degrees of relevancy. In addition, it is a processing in which the 
degrees of relevancy between the extracted document and words being used 

15 are calculated, and the extracted documents are arranged in descending 
order of the degree of relevancy. Incidentally, the processing in which 
the degree of relevancy is calculated to arrange according to the degree 
of relevancy is the same as in the conventional art, and a method is 
well-known in which words in the document are weighted using "TFIDF" 

20 and/or "information volume of Kullback", and then, the document is 
represented as a vector by using the weights of the words, and the degree 
of relevancy of the document is calculated by* the scalar product. 
Moreover, JP-A-20 0 0-3 152 07 and JP-A-2002-245061 also disclose the same 
kinds of methods, for instance. These processing results are held in 

25 a work memory area in the search server 5. 

Incidentally, as for words used in each document, words that have 
a high degree of relevancy may be specified as related words in advance, 
and they may be registered into the related word data storage 55 to use 
them for the aforementioned processing. For instance, data as shown in 

30 Fig. 4 is registered in the related word data storage 55. In an example 
of Fig. 4, related words are enumerated so as to correspond to the 



publication number. 

Then, the search server 5 generates data of a search result 
display page including the processing result at the step S14, and 
transmits it to the client terminal 3 (step S15) . The client search 
5 interface 33 in the client terminal 3 receives the data of the search 
result display page from the search server 5, and displays it on the 
display device (step S17) . 

For instance, a screen as shown in Fig. 5 is displayed on the 
display device 31. In an example of Fig. 5, the screen includes a keyword 

10 input column 301, clear button 302, search re-execution button 318, 
related word display column 303 to display words deeply related to the 
input keyword in order of rankings, association dictionary designation 
column 304 to specify database (s) that stores documents to be searched, 
text set designation column 305 to specify the entire or subset of the 

15 database (s) specified in the association dictionary designation column 
304, word display column 316 to display (the number of related words 
in the document group in which the input keyword occurs / the number 
of related words in all of the document groups) , text display column 
317 to display (the number of documents in which the input keyword occurs 

20 / the total number of documents) , command designation column 307 to 
select a display form such as a table generation, graph generation, flow 
generation, or the like, command button 306 to cause to carry out a 
display in the display form specified in the command designation column 
307, vertical axis setting column 308 to carry out a setting of the 

25 vertical axis in the graph and/or to specify a related word type, 
horizontal axis setting column 309 to carry out a setting of the 
horizontal axis in the graph or the like, button 310 to return a value 
in the vertical axis setting column to the default, button 311 to return 
a value in the horizontal axis setting column to the default, word 

30 pattern input column 312 to search for a compound word, button group 
313 (on/of f /reverse) that is selected when narrowing search is carried 



out, radio button 314 for a condition (AND/OR) of the narrowing, and 
display column 315 to display documents extracted by the search. 

As shown in Fig. 5, the search keywords input at the step S7 are 
"software" and "information processing", the database "patented 
5 publication excerpts" is used, and the search is carried out for the 
whole of the database. In this case, the number of related words in the 
document group in which the input keywords occur is 1353, and the number 
of related words in all of the document groups is 29220. Moreover, the 
number of documents in which the input keywords occur is 60, and the 

10 total number of documents is 2752 . The words associated with the document 
group including "software" and "information processing" are displayed 
in the related word display column 303 in order of the ranking of the 
degree of relevancy, and the document group including "software" and 
"information processing" is displayed in the document display column 

15 315 in order of the ranking of the degree of relevancy. 

The user specifies the display form of the search result in the 
command designation column 307 on this screen, and clicks the command 
button 306. For instance, it is assumed that "table generation" was 
selected as shown in Fig. 5. The client search interface 33 in the client 

20 terminal 3 accepts the command input as to the display form by the user, 
and transmits a command as to the display form to the search server 5 
(step S19) . The data operating unit 52 in the search server 5 receives 
the command as to the display form from the client terminal 3 (step S21) , 
generates display data to achieve the designated display form by using 

25 the data extracted at the step S13, and stores it into the work memory 
area (step S23) . 

The data operating unit 52 clusters the extracted documents by 
using the extracted data (here, which includes the abstract, claims, 
other texts, and bibliographic information in 60 documents) (step S31 

30 in Fig. 6) . As for this clustering, the degree of relevancy is calculated 
for each related word of the extracted documents, and the grouping of 
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words having a high degree of relevancy is carried out. In addition, 
the degrees of relevancy between the words categorized into each group 
and each document are calculated, and each document is associated with 
a group (i.e. cluster) with a high degree of relevancy. Incidentally, 
5 as for this clustering calculation, a method is typically known in which 
each word in the documents is weighted by using "TFIDF" and/or 
"information amount of Kullback", then, the document is represented as 
a vector by using the weights of the words, the degree of relevancy of 
the document is calculated by the scalar product of the vector, and a 
10 clustering algorithm is applied to a relevancy degree matrix obtained 
by calculating the degrees of relevancy among all of the documents. For 
instance, as for the details, see the aforementioned patent 
publications . 

In the example of Fig. 5, the following clusters are generated. 
15 1st cluster: form, image, limitation, identification 

information, goods, delivery, identification 

2nd cluster: name, version number, version, reference relation 
information, reference relation, change, version management 

3rd cluster: detachable, information management method, 
20 information management, ID, user, software information, user ID 

4th cluster: instruction, comparison, address register, 
execution, address, region, debugging 

5th cluster: e-mail, electric, mail, transmission, generation, 
reception, terminal 
25 6th cluster: version, version upgrade, up, equipment, version 

information, serial, date 

The document included in a specific cluster is a document having a high 
relevancy with the related words included in the specific cluster. 

Incidentally, the degree of relevancy of a certain word with a 
30 specific document is calculated as follows : (appearance frequency of the 
certain word or phrase in the specific document) / (the number of 
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documents in which the certain word appears in the database (patented 
publication group in the above example) ) This is defined in a viewpoint 
in which the word that frequently appears in the specific document is 
a feature word of the specific document, and on the other hand, the word 
5 that appear in a lot of documents can not be said as a feature word in 
the specific document. However, it is also possible to calculate the 
degree of relevancy by other methods . 

Next, the data operating unit 52 specifies display data item of 
each document based on the command as to the specified display form (step 

10 S33) . Here, the publication number, title, and the applicant name are 
set in advance as the display data items, and such data is specified 
for each document. Then, the aforementioned display data item for each 
document is arranged for each cluster (step S35) . Incidentally, in this 
embodiment, the vertical axis represents the cluster and the horizontal 

15 axis represents the applicant. That is, each document is classified by 
a set of the cluster and applicant. However, it is also possible to use 
another data as to the horizontal axis. Finally, it generates display 
data in a data format which a specific display processor 34 in the client 
terminal 3 can process, and stores it into the work memory area (step 

20 S37) . The specific display processor 34 is selected so as to correspond 
to the command "table generation" . 

The data operating unit 52 carries out the aforementioned 
processing, and transmits the generated display data to the client 
terminal 3 (step S25 in Fig. 2) . The client search interface 33 of the 

25 client terminal 3 receives the display data from the search server 5 
(step S27) . Then, the client search interface 33 confirms whether the 
display processor 34 corresponding to the received display data is 
activated, and the display processor 34 corresponding to the received 
display data is activated if it has not been activated. The display 

30 processor 34 displays the search result on the display device 31 by using 
the received display data in the display form designated by the user 
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(step S29) . Incidentally, the processing shifts to Fig. 11 through 
terminals A and B. 

A screen example is shown in Fig . 7 . The example of Fig . 7 includes 
a display form selection column 401 and table 410. The display form 
5 selection column 401 is a combo box to select the following display form. 
For instance, either the patent map (Fig. 7), patent flow (Fig. 14), 
graph (Fig. 19), skeleton map (Fig. 24) or anchor map (Fig. 26) can be 
selected as shown in Fig. 8. The table 410 includes a cluster column 
402 to display data of each cluster, column 403 to display pertinent 

10 documents (10 documents) of company A to each cluster, column 404 to 
display pertinent documents (8 documents) of company B to each cluster, 
column 405 to display pertinent documents (6 documents) of company C 
to each cluster, and column 406 to display pertinent documents (5 
documents) of company D to each cluster . As described above, the vertical 

15 axis is an axis for the applicants. On the other hand, the table 410 
includes a row 407 of the 3rd cluster, row 408 of the 4th cluster, and 
row 409 of the 5th cluster. Incidentally, as for the lines of the 1st 
and 2nd clusters, though they are not shown because of the lack of the 
screen space, they can be displayed by operating the scroll bar or the 

20 like. Thus, the user can analyze that applications by which applicant 
are included in which cluster more. Moreover, it can be grasped that 
which applicant files a lot of applications in the search result. 

Incidentally, each of the display items including a set of the 
publication number, title, and applicant name can be selected for the 

25 following processing. Moreover, it is also possible to carry out the 
region designation, and the display items included in the region are 
selected. For instance, as shown in Fig. 9, 10 display items (documents) 
included in the row 408 of the 4th cluster and the column 403 of the 
company A, column 404 of the company B, and column 405 of the company 

30 C can be selected by designating a region 415. Incidentally, it is 
possible to select the display item (document) one by one. It is possible 
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to adopt a display form in which each display item is a button, for 
instance, and can be clicked. 

In the example of Fig. 9, it is assumed that the display items 
(documents) included in the region 415 were selected, and "patent flow" 
5 was selected as the following display form in the display form selection 
column 401. Then, when the user pushes a right button' of the mouse that 
is the input device 32, a menu shown in Fig. 10 is displayed. The example 
of Fig. 10 includes the following menu items: "activate external 
command" 421, environmental setting, copy, paste, paste with type 

10 designation, insert, delete, clear numerical expression and value, 
insert comment, format setting of cell , select from list, and hyperlink . 
Here, the "activate external command" 421 is selected. Incidentally, 
in this embodiment, it is considered that all display items are tacitly 
selected, if any display item is not selected. 

15 Then, the display processor 34 accepts the selection input of 

the display form and display items from the user (step S41 in Fig. 11) , 
and outputs data concerning the selected display items and display form 
to the client search interface 33 of the client terminal 3. 

The client search interface 33 transmits the data concerning the 

20 selected display items and display form to the search server 5 (step 
S43) . The data operating unit 52 of the search server 5 receives the 
data concerning the selected display items and display form, and 
temporarily stores it into the work memory area (step S45) . Then, it 
extracts the data of the documents corresponding to the selected display 

25 items from the document DB 53, for instance (step S47) . When the previous 
processing result has still been held in the work memory area, it is 
also possible to use the data. Here, because the display item includes 
at least the publication number, the data of ten documents can be 
specified from the publication number. Then, it generates display data 

30 to achieve the specified display form (here, "patent flow") by using 
the extracted data, and stores the display data in the work memory area 
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(step S49) . 

The data operating unit 52 calculates the degree of relevancy 
between the selected documents by using the data of the documents 
corresponding to the selected display items (step S61 in Fig. 12). 
5 Because the calculation of the degree of relevancy between the selected 
documents is also well-known technology, it is not explained here. 
Incidentally, for instance, data as shown in Fig. 13 is generated and 
is stored into, for instance, the relevancy degree data storage 54. In 
an example of Fig. 13, 10 selected documents are arranged vertically 

10 and horizontally in the same manner, and the degrees of relevancy with 
the other 9 documents are registered to one document. However, for 
example, "-" is indicated in a case where the degree of relevancy is 
equal to or lower than a predefined threshold. 

Then, it specifies display data items from the data of each 

15 document (step S63) . Here, the publication number, title, and applicant 
are set in advance as the display data items, and such data is specified 
for each document. Moreover, it generates display data in a state where 
display items including the display data items are arranged in the time 
series according to the filing date or the publication date included 

20 in the data of the documents, and the display items are connected by 
segments with the thickness according to the degree of relevancy in a 
format which the display processor 34 for the patent flow display can 
process (step S65) . 

Then, the search server 5 transmits the generated display data 

25 to the client terminal 3 (step S51 in Fig. 11). The client search 
interface 33 of the client terminal 3 receives the display data from 
the search server 5 (step S53) . Then, it confirms whether the display 
processor 34 corresponding to the received display data has been 
activated. The display processor 34 is activated if it has not been 

30 activated. Then, it outputs the display data to the display processor 
34. The display processor 34 corresponding to the display data displays 
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the display data on the display device 31 in the display form designated 
by the user (step S55) . Incidentally, when another searching is carried 
out, the processing shifts to Fig. 2 through a terminal C. On the other 
hand, the processing returns to the step S41 when the display form and 
5 the display items are selected to progress the analysis as explained 
below (step S57) . 

For instance, a screen as shown in Fig. 14 is displayed. In an 
example of Fig. 14, 10 display items 501 to 510 are arranged from the 
left to the right in the time series according to the publication date 

10 or filing date. Moreover, a segment connects between the display items 
that have the degree of relevancy equal to or more than a predetermined 
threshold, and the segment between this display items corresponding to 
the documents with a higher degree of relevancy is shown thicker. For 
instance, because the display item 502 and display item 503 have the 

15 highest degree of relevancy in the 10 display items, they are connected 
by the thickest segment. Moreover, the display item 502 and display item 
510, the display item 505 and display item 506, and the display item 
505 and display item 509 also have a comparatively high degree of 
relevancy, and they are respectively connected by a medium thick line. 

20 Incidentally, all of these display items are selectable, and they may 
be buttons. Moreover, one or plural display items may be selected by 
specifying a region. 

Moreover, not only data concerning the connection relation 
between the display items and thickness but also the actual degree of 

25 relevancy can be held to the segment that connects between the display 
items. In this case, for instance, the display as shown in Fig. 15 may 
be carried out by providing a display switch button in addition to, for 
instance, the display portion in Fig . 14, and clicking the display switch 
button. In Fig. 15, the degree of relevancy is displayed near the 

30 connection line between the display items being displayed. The display 
item is spaced with other display items to display the numerical value 
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of the degree of relevancy near the connection line. In the screen like 
Fig. 15, each display item is selectable, and one or plural display items 
can be selected. 

For instance, as shown in Fig. 16, it is assumed that the user 
5 specified a region 515 so as to include the display items 506 to 510, 
"patent flow" was selected again in the display item designation column 
(for instance, Fig. 7) provided outside the displayed portion in Fig. 
16. Then, it is assumed that the menu like Fig. 10, for instance, is 
further displayed by pushing the right button of the mouse, which is 

10 the input device 32, and "activate external command" 421 is selected. 

Thus, the display processor 34 of the client terminal 3 accepts 
the selection input concerning the display form and display items from 
the user (step S41 in Fig. 11), and outputs the data concerning the 
selected display form and display items to the client search interface 

15 33. The client search interface 33 receives the data concerning the 
selected display form and display items, and transmits it to the search 
server 5 (step S43) . 

The data operating unit 52 of the search server 5 receives the 
data concerning the selected display items and display form, and 

20 temporarily stores it into the work memory area (step S45) . Then, it 
extracts the data of the documents corresponding to the selected display 
items from the document DB 53, for instance (step S47) . If the previous 
processing result has still been held in the work memory area, it is 
also possible to use the data. Here, because the display item includes 

25 at least the publication number, the data of 5 documents can be specified 
from the publication number. Then, it generates display data to achieve 
the designated display form (here, "patent flow " ) by us ing the extracted 
data, and stores it in the work memory area (step S49) . That is, the 
processing shown in Fig. 12 will be executed for the data of the 5 

30 documents . 

Then, the search server 5 transmits the generated display data 
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to the client terminal 3 (step S51) . The client search interface 33 of 
the client terminal 3 receives the display data from the search server 
5 (step S53) . Then, it confirms whether the display processor 34 
corresponding to the received display data has been activated. The 
5 display processor 34 is activated if it has not been activated, yet. 
Then, it outputs the display data to the display processor 34. The 
display processor 34 corresponding to the display data displays the 
display data on the display device 31 in the display form designated 
by the user (step S55) . 

10 For instance, the screen shown in Fig. 17 is displayed. In Fig. 

17, 5 display items are arranged in the time series according to the 
publication date or filing date. Each display item is selectable, and 
the segment with the thickness according to the degree of relevancy 
connects between the documents corresponding to the display items. 

15 However, the connection relation in the region 515 of Fig. 16 and the 
connection relation in Fig. 17 are apparently different. This is because 
there is a difference between the degree of relevancy among 5 documents 
and the degree of relevancy among 10 documents. Thus, because the region 
515 is not merely enlarged and displayed, but the degree of relevancy 

20 is re-calculated among the selected documents in this embodiment, it 
becomes possible to show the relationship among only the selected 
documents. In Fig. 17, it is also possible to specify the display items 
and display form to progress the analysis. Moreover, when any display 
items are not specified, it is assumed that all of the display items 

25 are selected to carry out the processing. 

Moreover, it is assumed that "graph" was selected in the display 
form designation column (for instance, Fig. 7) provided outside the 
display portion of Fig. 15 without selecting any display items, for 
instance, in the state shown in Fig. 15. In addition, it is also assumed 

30 that the menu like Fig. 10, for instance, was displayed in response to 
the push of the right button of the mouse, which is the input device 
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32, and "activate external command" 421 was selected. As described above, 
it is assumed that all of the display items were selected when any display 
item was not selected. 

Thus, the display processor 34 of the client terminal 3 accepts 
5 the selection input concerning the display form and display items from 
the user (here, the display items were selected indirectly) (step S41 
in Fig. 11) , and outputs the data concerning the selected display form 
and display items to the client search interface 33. The client search 
interface 33 receives the data concerning the selected display form and 

10 display items, and transmits it to the search server 5 (step S43) . 

The data operating unit 52 of the search server 5 receives the 
data concerning the selected display items and display form, and 
temporarily stores it into the work memory area (step S45) . Then, it 
extracts the data of the documents corresponding to the selected display 

15 items from the document DB 53, for instance (step S47) . In a case where 
the previous processing result has still been held in the work memory 
area, it is also possible to use the data. Here, because the display 
item includes at least the publication number, the data of 10 documents 
can be specified from the publication number. Then, it generates display 

20 data to achieve the designated display form (here, "graph") by using 
the extracted data, and stores it into the work memory area (step S49) . 

The data operating unit 52 reads out the data of related words 
of the documents (Hereafter, it is also called "selected documents") 
corresponding to the selected display items from the related word data 

25 storage 55 (step S71 in Fig. 18) . Then, it selects a predetermined number 
of related words with a high degree of relevancy with the selected 
document group (step S73) . Because it is well-known technology as for 
the calculation of the degree of relevancy between the documents and 
words, the detailed explanation is omitted, here. Then, it classifies 

30 the selected documents according to the related words, aggregates the 
number of documents in each class based on a specific issue (here, filing 
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year) , and stores the result data into the work memory area, for example 
(step S75) . Here, as to each filing year, it counts up the number of 
documents including the related word for each related word. Then, it 
generates display data to display the aggregation result in the graph 
5 is with a format that the display processor 34 for the graph display 
can process, and stores it into the work memory area, for instance (step 
S77) . 

Then, the search server 5 transmits the generated display data 
to the client terminal 3 (step S51 in Fig. 11) . The client search 

10 interface 33 of the client terminal 3 receives the display data from 
the search server 5 (step S53) . Then, it confirms whether the display 
processor 34 corresponding to the received display data has already been 
activated. The display processor 34 is activated if it has not been 
activated, yet. Then, it outputs the display data to the display 

15 processor 34. The display processor 34 corresponding to the display data 
displays the display data on the display device 31 in the display form 
designated by the user (step S55) . 

For instance, a screen as shown in Fig. 19 is displayed. The screen 
example of Fig. 19 includes a display form designation column 601 and 

20 selection column 602 to select the filing year or the applicant, and 
the vertical axis represents the number of cases (the number of 
documents) , and the horizontal axis represents the filing year. In this 
example, 15 related words are displayed as display items, and one or 
plural related words are selectable. That is, words or phrases such as 

25 "peripherals" and "automatic operation" are selected regardless of the 
filing year. 

Moreover, the display can be changed to a graph as shown in Fig. 
20 by changing the choice in the selection column 602 from "filing year" 
to "applicant". In the example of Fig. 20, there is no change as to the 
30 related words that are the display items. However, because the 
horizontal axis is changed from "filing year" to "applicant", the number 



of documents including each related word is aggregated for each 
applicant to display it. Even in Fig. 20, one or plural related words 
that are the display items are selectable. 

For example, in the graph whose horizontal axis is set to the 
5 filing year, it is assumed that "skeleton map" in the display form 
designation column 601 as shown in Fig. 21 was selected, and 
"peripherals" 615 of the related word that is the display items was 
further selected. In Fig. 21, the thick line or the like indicates that 
the item has been selected. In addition, when the right button of the 

10 mouse that is the input device 32 is pushed, a menu as shown in Fig. 
22 is displayed. In the menu of Fig. 22, choices "series selection" 620 
and "setting of environment", "former data", and "clear" are provided. 
It is assumed that the series selection 620 was selected, here. 

Thus, the display processor 34 of the client terminal 3 accepts 

15 the selection input concerning the display form and display items from 
the user (step S41 in Fig. 11), and outputs the data concerning the 
selected display form and display items to the client search interface 
33. The client search interface 33 receives the data concerning the 
selected display form and display items, and transmits it to the search 

20 server 5 (step S43) . 

The data operating unit 52 of the search server 5 receives the 
data concerning the selected display items and display form, and 
temporarily stores it into the work memory area (step S45) . Then, it 
extracts data of the documents corresponding to the selected display 

25 items from the document DB 53, for instance (step S47) . In a case where 
the previous processing result has still been held in the work memory 
area, it is also possible to use the data. For instance, because the 
display item is "peripherals" here, it specifies the documents, to which 
"peripherals" is registered as a related word, based on the data stored 

30 in the related word data storage 55 by the publication number. Then, 
it extracts data of the documents from the publication numbers. It 



generates display data to achieve the designated display form (here, 
"skeleton map") by using the extracted data, and stores it into the work 
memory area (step S49) . 

The data operating unit 52 calculates the degree of relevancy 
5 between words for each related word (words other than "peripherals", 
here) that is the data of the extracted document, and stores it into 
the work memory area, for instance (step S81 in Fig. 23) . Incidentally, 
in a case where the degree of relevancy is equal to or lower than a 
predetermined reference, the degree of relevancy is regarded as 0 . It 

10 is also possible to store this data in the relevancy degree data storage 
54. Then, it generates display data in the display form in which each 
word is treated as a display item and the segment between words has the 
thickness according to the degree of relevancy between the words, in 
a format which the display processor 34, which displays data in the 

15 skeleton map, can process, and stores it into the work memory area, for 
instance (step S83) . 

Then, the search server 5 transmits the generated display data 
to the client terminal 3 (step S51 in Fig. 11). The client search 
interface 33 of the client terminal 3 receives the display data from 

20 the search server 5 (step S53) . Then, it confirms whether the display 
processor 34 corresponding to the received display data has been 
activated. The display processor 34 is activated if it has not been 
activated, yet. Then, it outputs the display data to the display 
processor 34. The display processor 34 corresponding to the display data 

25 displays the display data on the display device 31 in the display form 
designated by the user (step S55) . 

For instance, a screen as shown in Fig. 24 is displayed. In Fig. 
24, each related word is a display item, and it is possible to select 
one or plural display items. Moreover, in this embodiment, it is judged 

30 that all of the display items are selected when any display items are 
not selected. In addition, by specifying a region, it is also possible 
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to specify the display items (related words) included in the region. 
In the example of Fig. 24, the related words connected via the thick 
segment are deeply related, and the degree of relevancy lowers when the 
thickness of the segment becomes thin from a medium thick line toward 
5 a thin line. Incidentally, as for the words not connected, it represents 
that the degrees of relevancy is equal to or lower than a predetermined 
reference . 

Though it is not show in Fig. 24, the display form designation 
column to shift from this skeleton map to the following display form 

10 is provided. Therefore, after the user selects the following display 
form in the display form designation column, and further selects display 
items, the menu screen as shown in Fig. 10, for instance, is displayed, 
when he or she pushes the right button of the mouse that is the input 
device 32. When "activate external command" is selected here, the 

15 selection input concerning the display form and display items are 
carried out. Here, it is assumed that all of the display items were 
selected, and "anchor map" was selected as a display form, for instance . 

Thus the display processor 34 of the client terminal 3 accepts 
the selection input concerning the display form and display items from 

20 the user (here, the selection of the display item is indirect) (step 
S41 in Fig. 11), and outputs the data concerning the selected display 
form and display items to the client search interface 33. The client 
search interface 33 receives the data concerning the selected display 
form and display items, and transmits it to the search server 5 (step 

25 S43) . 

The data operating unit 52 of the search server 5 receives the 
data concerning the selected display items and display form, and 
temporarily stores it into the work memory area (step S45) . Then, it 
extracts the data of the documents corresponding to the selected display 
30 items from the document DB 53, for instance (step S47) . In a case where 
the previous processing result has still been held, it is also possible 
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to use the data. Because all of the related words except "peripherals" 
were selected, it specifies the documents, to which either of all of 
the related words displayed in Fig. 23 is registered as a related word, 
based on the data stored in the related word data storage 55, by the 
5 publication number, for instance. Then, it extracts the data of the 
documents from the publication numbers . Incidentally, it is also 
possible to carry out a processing based on the data of the documents 
extracted by the last processing, assuming that there is no change in 
the documents to be processed, in a case where all of the related words 

10 are selected in Fig. 24. Then, it generates display data to achieve the 
designated display form (here, "anchor map") by using the extracted data, 
and stores it into the work memory area (step S49) . 

The data operating unit 52 groups the extracted documents based 
on a specific matter (here, applicant name) (step S91 in Fig. 25) . That 

15 is, the documents are categorized by each applicant . Next, it calculates 
the degrees of relevancy between documents included in each group and 
each related word, and stores them into the work memory area, for 
instance (step S93) . Moreover, it is also possible to store them in the 
relevancy degree data storage 54. Then, it generates display data in 

20 which each group (here, "company A", "company B", and "company C") is 
represented by the data of the specific matter (applicant name) , and 
the degrees of relevancy between the documents included in each group 
and each related word are represented by the thickness of the segments 
between the data of the specific matter for the group and the related 

25 words in a format which the corresponding display processor 34 can 
process, and stores it into the work memory area, for instance (step 
S95) . 

Then, the search server 5 transmits the generated display data 
to the client terminal 3 (step S51 in Fig. 11). The client search 
30 interface 33 of the client terminal 3 receives the display data from 
the search server 5 (step S53) . Then, it confirms whether the display 
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processor 34 corresponding to the received display data has been 
activated. The display processor 34 is activated if it has not been 
activated, yet. Then, it outputs the display data to the display 
processor 34 . The display processor 34 corresponding to the display data 
5 displays the display data on the display device 31 in the display form 
designated by the user (step S55) . 

For instance, a screen shown in Fig. 26 is displayed. In Fig. 
26, selectable display items are "company A" 701, "company B" 702, and 
"company C" 703 that are the applicant name, and all of the related words . 

10 However, because any related word whose degree of relevancy is equal 
to or lower than a predetermined threshold, is not connected with any 
applicant name, the display is omitted. These display items may be 
buttons to make it selectable. Moreover, it is also possible to 
separately provide any button or the like to indicate the selection. 

15 Moreover, it is possible to designate a region to select any display 
items included in the region. In Fig, 26, the positions of the applicant 
names "company A" 701, "company B" 702, and "company C" 703 are fixed, 
the segments with the thickness representing the degree of relevancy 
with the related word radically extend from these applicant names to 

20 the related words . Because there is a related word associated with plural 
document sets , the related word is connected with plural applicant names 
by plural segments . 

Though it is not shown in Fig. 26, it is also possible to provide 
the display form designation column to designate the following display 

25 form on Fig. 26. That is, it is possible to designate plural display 
items and display form to cause them to be displayed in the following 
display form. 

As described above, by the direct transition from a certain 
display form to another display form, such as a graph, table, various 
30 kinds of flows, or the like, the analyst can investigate and analyze 
data without terminating his or her consideration on the way of his or 
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her analysis. Moreover, because the meanings of the aforementioned 
display forms that respectively represent the classification based on 
the degree of relevancy, transition based on a specific matter of the 
document content, or degree of relevancy along the time series, are 
5 different each other, this embodiment supports the analyst to progress 
the investigation and analysis in various viewpoints. 

In addition, because a range to be analyzed can be specified when 
shifting to another display form, it is possible to carry out associative 
investigation and analysis. Moreover, because when the range that the 

10 analyst wants to analyze is specified, the degree of relevancy or the 
like is newly calculated according to the documents within the range 
to display the results, the analyst's consideration can be supported 
by the newly displayed classification with higher accuracy. 

Though an embodiment of this invention was explained above, this 

15 invention is not limited to the aforementioned embodiment. For instance, 
though an example of the display form transition was explained using 
figures, it is not necessary to take such a route, and it is possible 
to select an arbitrary display form at any stage. The shif table display 
form may be limited by some reasons. However, it is possible to 

20 fundamentally shift to an arbitrary display form. 

Moreover, the functional blocks shown in Fig. 1 are mere examples, 
and they do not necessarily correspond to actual program modules. 
Moreover, though only one implementation example in the client-server 
environment was explained, it is also possible to carry out the 

25 aforementioned processing by only a terminal apparatus with the 
functions of the search server 5 and the client terminal 3 . Only the 
search function may be provided on the server. 

Moreover, the display examples are mere examples, therefore, it 
is not necessary to configure the aforementioned screen. 

30 In addition, the calculation method of the degree of relevancy 

is arbitrary, and as for the display forms, they are not limited to the 
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aforementioned ones. It is also possible to process the data of the 
documents by other processing algorithm to present it to the user. 

Although the present invention has been described with respect 
to a specific preferred embodiment thereof, various change and 
5 modifications may be suggested to one skilled in the art, and it is 
intended that the present invention encompass such changes and 
modifications as fall within the scope of the appended claims . 
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